Corpus: rus-tj_web_2016_30K

Other corpora

4.4.1.5 Number of Word-N-grams at Sentence Endings

Number of word-N-grams for N=1...5 for the first K sentences

K # of words # of bigrams # of trigrams # of 4-grams # of 5-grams
100 92 97 98 98 98
1000 734 808 836 844 848
10000 6432 8910 9431 9570 9657
100000 15592 25328 28080 28829 29049
1000000 15592 25328 28080 28829 29049


Zipf's diagram for sentence endings


Gnuplot diagram

5177 msec needed at 2018-06-12 22:41